ClaPIM: Scalable Sequence Classification Using Processing-in-Memory

نویسندگان

چکیده

Deoxyribonucleic acid (DNA) sequence classification is a fundamental task in computational biology with vast implications for applications such as disease prevention and drug design. Therefore, fast high-quality classifiers are significantly important. This article introduces ClaPIM, scalable DNA architecture based on the emerging concept of hybrid in-crossbar near-crossbar memristive processing-in-memory (PIM). We enable efficient by uniting filter search stages within single algorithm. Specifically, we propose custom filtering technique that drastically narrows space approach facilitates approximate string matching through distance function. ClaPIM first PIM benefits from high density crossbar arrays massive parallelism PIM. Compared Kraken2, state-of-the-art software classifier, provides higher quality (up to $20 \times $ improvement F1 score) also demonstrates notation="LaTeX">$1.8 throughput improvement. edit tolerant (EDAM), recently proposed static random-access memory (SRAM)-based accelerator restricted small datasets, observe both notation="LaTeX">$30.4 normalized per area 7% increase precision.

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Scalable In-Memory Transaction Processing with HTM

We propose a new HTM-assisted concurrency control protocol, called HTCC, that achieves high scalability and robustness when processing OLTP workloads. HTCC attains its goal using a two-pronged strategy that exploits the strengths of HTM. First, it distinguishes between hot and cold records, and deals with each type differently – while accesses to highly contended data are protected using conven...

متن کامل

Scalable Distributed Query Processing in Parallel Main-Memory Database Systems

The continuous increase in compute speed and main-memory capacity of modern servers triggered the development of a new generation of in-memory database systems. These systems completely rewrote the traditional database architecture to use main memory as primary storage. Discarding several now obsolete abstractions of disk-based database systems enabled unprecedented query performance on a singl...

متن کامل

Memory Matching Networks for Genomic Sequence Classification

When analyzing the genome, researchers have discovered that proteins bind to DNA based on certain patterns on the DNA sequence known as “motifs”. However, it is difficult to manually construct motifs due to their complexity. Recently, external learned memory models have proven to be effective methods for reasoning over inputs and supporting sets. In this work, we present memory matching network...

متن کامل

Scalable Packet Classification Using Condensate Bit Vector

Packet classification is important in fulfilling the requirements of differentiated services in next generation networks. One of interesting hardware solutions proposed to solve the packet classification problem is bit vector algorithm. Different from other hardware solutions such as ternary CAM, it efficiently utilizes the memories to achieve an excellent performance in medium size policy data...

متن کامل

Learning Interpretations Using Sequence Classification

In this paper we present a system that assigns interpretations, in the form of shallow semantic frame descriptions, to natural language sentences. The system searches for relevant patterns, consisting of words from the sentences, to identify the correct semantic frame and associated slot values. For each of these choices, a separate classifier is trained. Each classifier learns the boundaries b...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

ژورنال

عنوان ژورنال: IEEE Transactions on Very Large Scale Integration Systems

سال: 2023

ISSN: ['1063-8210', '1557-9999']

DOI: https://doi.org/10.1109/tvlsi.2023.3293038